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Abstract 



We show that the left (right) sample quantile tends to the left (right) distribution quantile 
at p £ [0, 1], if the left and right quantiles are identical at p. We show that the sample 
quantiles diverge almost surely otherwise. The latter can be considered as a generalization 
of the well-known result that the sum of a random sample of a fair coin with 1 denoting 
heads and -1 denoting tails is infinitely often. In the case that the sample quantiles 
do not converge we show that the limsup is the right quantile and the liminf is the left 
l~j ■ quantile. 

»-Q ' Keywords: quantile function; convergence; divergence; limit 
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1 Introduction 

The traditional definition of quantiles for a random variable X with distribution 

function F, 
00 . 
[> . kxip) = inf{x|F(x) > p}, 



appears in classic works as PJ. We call this the "left quantile function". In some 
q ■ books (e.g. [5]) the quantile is defined as 



rqx(p) = mi{x\F(x) > p} = sup{x\F(x) < p}, 

this is what we call the "right quantile function". Also in robustness literature 
people talk about the upper and lower medians which are a very specific case of 
these definitions. Hosseini in [3] considers both definitions, explore their relation 
and show that considering both has several advantages. 

Section |2] studies the limit properties of left and right quantile functions. In 
Theorem 12.51 we show that if left and right quantiles are equal, i.e. IqF^p) = r( lF(p), 
then both sample versions lqp n , rqp n are convergent to the common distribution 
value. We found an equivalent statement in Serfling [6] with a rather similar proof. 



2 Limit theory 



The condition for convergence there is said to be /(?f(p) being the unique solution 
of F(x—) < p < F(x) which can be shown to be equivalent to Zgp(p) = T( 1f{p)- 
Note how considering both left and right quantiles has resulted in a cleaner, more 
comprehensible condition for the limits. In a problem Serfling asks to show with an 
example that this condition cannot be dropped. We show much more by proving 
that if lqF(p) 7^ t <If{p) then both rq Fn {j>) and rq Fn {j>) diverge almost surely. The 
almost sure divergence result can be viewed as an extension to a well-known result 
in probability theory which says that if Xi, X2, ■ ■ ■ an i.i.d sequence from a fair coin 
with -1 denoting tail and 1 denoting head and Z n = Y^=i Xi then P(Z n = i.o.) = 
1. The proof in [2] uses the Borel-Cantelli Lemma to get around the problem of 
dependence of Z n . This is equivalent to saying for the fair coin both lqp n (l/2) and 
rg^ n (l/2) diverge almost surely. For the general case, we use the Borel-Cantelli 
Lemma again. But we also need a lemma (Lemma 12. ip which uses the Berry-Esseen 
Theorem in its proof to show the deviations of the sum of the random variables can 
become arbitrarily large, a result that is easy to show as done in [2] for the simple fair 
coin example. Finally, we show that even though in the case that lqF{p) 7^ r qF{p), 
lqF„, r qF n are divergent; for large ns they will fall in 

(lq F (p) ~ e, kF{p)] U [rq F (p),rq F (p) + e). 
In fact we show that 

lim inf lq Fn (p) = lim inf rq Fn (p) = lq F (p) 

n— >oo n— >oo 

and 

lim sup lq Fn (p) = lim sup rq Fn (p) = rq F (p). 

The proof is done by constructing a new random variable Y from the original random 
variable X with distribution function Fx by shifting back all the values greater than 
rqx{p) to lqx(p)- This makes lqy{p) = vqyip) in the new random variable. Then 
we apply the convergence result to Y. 

2 Limit theory 

To prove limit results, we need some limit theorems from probability theory that 
we include here for completeness and without proof. Their proofs can be found in 
standard probability textbooks and appropriate references are given below. If we are 
dealing with two samples, X\, • • • , X n and Yi, • • • , Y n , to avoid confusion we use the 
notation F Uj x and F n y to denote their empirical distribution functions respectively. 



Definition 2.1: Suppose X\,X2, ■ ■ ■ , is a discrete-time stochastic process. Let F(X) 
be the cr-algebra generated by the process and J-"(X n , X n+i , • • ■ ) the a-algebra gener- 
ated by X n , X n+1 , ■ ■ ■ . Any E G F(X) is called a tail event if E e ^(Xn, X n+1 , ■ ■ ■ ) 
for any nGN. 

Definition 2.2: Let {A n } n( z?q be any collection of sets. Then {A n i.o.}, read as A n 
happens infinitely often is defined by: 

{A n i.o.} = n ieN U°° =i A j . 

Theorem 2.1: (Kolmogorov 0-1 law): 

E being a tail event implies that P(E) is either or 1. 

Proof See [2]. ■ 

Theorem 2.2: (Glivenko-Cantelli Theorem): 

Suppose, Xi,X 2 , ■ ■ • , i.i.d, has the sample distribution function F n . Then 

lim sup \F n (x) — F(x)\ — > 0, a.s.. 
Proof See p]. ■ 



Theorem 2.3: (Borel-Cantelli lemma): 
Suppose (Q, J 7 , P) is a probability space. Then 

1. A n e 7 and £~ P(A n ) < oo =>• P(A n i.o) = 0. 

2. A n E J 7 independent events with ^J°-P(Ai) = oo =>• -P(A n i.o) = 1, where 
i.o. stands for infinitely often. 

Proof See 0. ■ 

Theorem 2.4: (Berry-Esseen bound): Let X 1 ,X 2 , ■ ■ ■ , be i.i.d with E(Xi) = < oo, 
E(X?) = o and £(|X;| 3 ) = p. If G n is the distribution of 

X x + ■ ■ ■ + X n /ay/n 

and $(x) is the distribution function of a standard normal random variables then 

\G n (x)-Q(x)\ <3p/a 3 v^. 
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Corollary 2.1: Let Xi,X 2 , • • ■ , be i.i.d with E(Xi) — /j, < oo, #(|Xj — /u| 2 ) = cr and 

EdXj-^l 3 ) = p. If G n is the distribution of (X l H \-X n -nfi)/(r-s/n = a/^^t^) 

and $(x) is the distribution function of a standard normal random variable then 

\G n (x)-$(x)\ <3p/a 3 Vn. 

Proof This corollary is obtained by applying the theorem to Yi = Xi — fi. ■ 

Now let A n = (Xi + ■ ■ • + X n — nfi)/ay/n. Then 



\P(A n > x) - (1 - $(x))| = |P(A, < x) - $(x)| = |G n (x) - $(x)| < 3p/a 3 v^. 
Also 

\P{x <A n <y)- (<%) - $(x)))| < |G n (y) - $(y)| + \G n (x) - $(x)| < 6p/cx 3 v^. 
These inequalities show that for any e > there exist N such that n > N, 

$(2 2 ) - *{ Zl ) - e < P(z x < v^(^^^) < z 2 ) < $(z 2 ) - 9( Zx ) + e, 

a 

for z\ < z 2 G R U {— oo, oo}. 

It is interesting to ask under what conditions lq Fn and rq Fn tend to lq F and 
rg^ as n — > oo. Theorem 12.51 gives a complete answer to this question. 

Theorem 2.5: (Quantile Convergence/Divergence Theorem) 

a) Suppose rq F (p) = lqp(p) then 

rqF n (p) ~>rq F (p), a.s., 

and 

kF n (p) ->kF(p), as- 

b) When IqFip) < rq F ip) then both rq Fn (p),lq Fn (p) diverge almost surely. 

c) Suppose IqFip) < T( 1f(p)- Then for every e > there exists N such that n > N, 

kF n ip),rq Fn (p) e (IqFip) ~ e,lq F (p)] U [rq F (p),rq F (p) + e). 



d) 

lim sup lq Fn (p) = lim sup rg^ (p) = rq F (p), a.s., 

and 

lim inf lq Fn (p) = lim inf rq Fn (p) = rq F {p), a.s.. 

Proof 

a) Since, lq F {p) = rq F (p), we use o,f(p) to denote both. Suppose e > is given. 
Then 

F(qf(p) - e) < p =>• F(q F (p) -e)=p-8 1 , 8 ± > 0, 

and 

^Mp) + e) > P => ^Mp) + e) = p + <5 2 , 5 2 > 0. 

By the Glivenko-Cantelli Theorem, 

F n {u) —¥ F{u) a.s., 
uniformly over R. We conclude that 

F n (qF(p) -£)->■ F{q F {p) -e)=p-6 1 , a.s., 

and 

F n (qF(p) + e) -»• F(q F (p) + e) = p + <5 2 , a.s.. 

Let e' = min( ^ 1 ' <52) . Pick iV such that for n > iV : 

p - <$i - e' < F n (q F (p) -e) <p-8 1 + e', 

p + 8 2 -e' < F n (q F (p) + e) <p + 8 2 + e'. 



Then 



F n (qF(p) - e) < p - <$i + e' < p =► 

^f„(p) > Of(p) - e and rg Fn (p) > g F (p) - e. 
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Also 

p < p + 5 2 - e' < F n (q F (p) + e) =► 

kF n (p) < Qf(p) + e and rq Fn (p) < q F (p) + e. 
Re-arranging these inequalities we get: 

Qf(p) ~ e < kF n (p) < Qf(p) + e, 
and 

qp(p) ~ e < rq Fn (p) < qp(p) + e. 

b) This needs more development in the sequel and the proof follows. 

c) This also needs more development in the sequel and the proof follows. 

d) If lqp{p) = rq F {p) the result follows immediately from (a). Otherwise sup- 
pose IqFip) < rq F (p). Then by (b) lqF n (p) diverges almost surely. Hence 
lim sup lq Fn (p) ^ liminf lqF n (p), o,.s. . But by (c), Ve > 0, 3iV, n> N 

l QF n (p) e (IqFip) ~ e,lq F (p)] U [rq F (p),rq F (p) + e). 
This means that every convergent subsequence of lq Fn (p) has either limit lqF{p) or 
rq F (p), a.s.. Since lim sup lqF„(p) 7^ liminf lqF n (p), o. s., we conclude lim sup lqF„(p) 
rq F {p) and liminf lqF n {p) = lqF(p), os.. 
A similar argument works for rq Fn (p). 



To investigate the case IqFip) ¥" r qF(p) more, we start with the simplest 
example namely a fair coin. Suppose X±,X2,--- an i.i.d sequence with P{Xi = 
-1) = P(Xi = 1) = | and let Z n = YTi=i X i- Note that 

Z n < & lq Fn (l/2) = -1, Z n > & lq Fn (l/2) = 1, 

and 

Z n < & rq Fn (l/2) = -1, Z n > & rq Fn (l/2) = 1. 

Hence in order to show that lq Fn (l/2) and lq Fn {l/2) diverge almost surely, we only 
need to show that P((Z n < i.o.) fl (Z n > i.o.)) = 1. We start with a theorem 
from [2]. 



Theorem 2.6: Suppose X{ is as above. Then P(Z n = i.o.) = 1. 

Proof The proof of this theorem in [2] uses the Borel-Cantelli Lemma part 2. ■ 

Theorem 2.7: Suppose, X h X 2 , ■ ■ ■ i.i.d. and P(X { = -1) = P(X { = 1) = 1/2. Then 
lqF n {l/2) and rgi? n (l/2) diverge almost surely. 

Proof Suppose, A = {Z n = —1 i.o.} and P = {Z n = 1 i.o.}. It suffices to show 
that 

P(AnB) = l. 

But u; G A fl P =>- lq Fn {p){uj) = —1, i.o. and lq Fn (p)(u) = 1, i.o. Hence lq Fn (p)(uj) 
diverges. 

Note that -P(^4) = P{B) by the symmetry of the distribution. Also it is 
obvious that both A and B are tail events and so have probability either zero or 
one. To prove P(A fl B) = 1, it only suffices to show that P(A U5) > 0. Because 
then at least one of A and B has a positive probability, say A. 

P(A) > => P(A) = 1 => P(S) = P(A) = 1 => P(A n B) = 1. 

Now let C = {Z n = 0, i.o.}. Then P(C) = 1 by Theorem EU If Z n (u) = then 
either Z n+ i(u>) = 1 or Z n+ i(u>) = — 1. Hence if Z n (o;) = 0, i.o. then at least for one 
of a = 1 or a = — 1, Z n (u) = a, i.o.. We conclude that w G i U B. This shows 

CcAUB^P(AU5) = l. ■ 

To generalize this theorem, suppose Xi,X 2 , ■ • • , arbitrary i.i.d process and 
Iqf(p) < T( 1f{p)- Define the process 

Y fl X 4 >rg F (p) 
4 [0 X 4 </g F (p). 

(Note that P(lq x {p) < X < rq x (p)) = 0.) Then the sequence Yi,Y 2 ,--- is i.i.d., 
P(^ = 0) = p and P(T; = 1) = 1 - p. Also note that 

lqF nY (p) diverges a.s. =3- lqp nX (p) diverges a.s. 

Hence to prove the theorem in general it suffices to prove the theorem for the Yi 
process. However, we first prove a lemma that we need in the proof. 
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Lemma 2.1: Let Y 1 ,Y 2 , ■ ■ ■ i.i.d with P(Yi = 0) = p = 1 - q > and P(F; = 1) = 
1—p = q > 0. Let S* n = Y17=i Yi, < a, A; G N. Then there exists a transformation 
0(/c) (to N) such that 

^(^( fc ) - <t>{k)q < -k) > 1/2 - a, 

P(S m - <j>{k)q >k)> 1/2 - a. 

Remark. For a = 1/4, we get 

P(S m - <P(k)q < -k) > 1/4, 

P(S m - (j>{k)q >k)> 1/4. 

Proof Since the first three moments of Yi are finite (E(Y,j) = q,E(\Yi — q\ 2 ) = 
q(l — q) = a, E(\Yi — q\ 3 ) = q 3 (l — q) + (l — q) 3 q = p), we can apply the Berry-Esseen 
theorem to y/n n ~ M . By a corollary of that theorem, for f > there exists an JVi 
such that 

1 - *(*) - % < P(yfi£^± >z)<\- *{z) + % 



and 






for all z E R and n > N\. Now for the given integer k pick iV 2 such that 

1 a k 1 a 
< $( — — ) < - + -. 

2 2 Vv^Va 7 2 2 

This is possible because $ is continuous and $(0) = 1/2. Now let 

k 
<f>(k) =max{JVi,JV 2 }, z 



ay/m' 
Then since (f>(k) > iVi 

P( v 7 ^)^^ > *) > 1 - *(*) - | > 1/2 - a, 



and 

a 
These two inequalities are equivalent to 



P(^m m - < -z) > Hz) - - > 1/2 - a. 



P((S m - 4>{k)q) < -k) > 1/2 - a, 

and 

P((S m - 4>(k)q) >k)> 1/2 - a. 

If we put a = 1/4, we get 

P((S m - 0(%) < -k) > 1/4, 

and 

P((S m - (f>{k))q >k)> 1/4. 



We are now ready to prove Part b) of Theorem 12.51 
Proof [Theorem I2.5[ Part b)] 

For the process {Yj} as defined above, let n\ = l,m k = n k + 4>{n k ) and 
n k+i — m k + ^(wfc). Then define 

Ac = (^n fc +i H h y mfc - (m fc - n fc )g < -n fc ), 

-Efc = (Xm fe +i H h ^n fc+1 - (n fc+ i - m k )q > m k ), 

Cr = Dk n -Efc- 

Since {C^} involve non-overlapping subsequences of Y,, they are independent events. 
Also D k and £& are independent. Now note that 



Y nk +i H h Vm fc - (m fe - n fc )g < -n fe => 

Y x -\ h F mfe < -n k + (m k - n k )q + n k => 

- m k -n k 

Y mh < q < q => 

m k 

lQF n<Y (p) =rq Fn . x =0^ 

{C fc , io.} C {lq Fn . Y (p) = rq F = 0, i.o.}. 
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Similarly, 

Y mk+1 H h y„ fc+1 - (n fc+ i - m fe )g > m fc 

=>- Yi H h F nfe+1 > (nfc+i - m k )q + m k 

_^ v ^ m k + (n k +i ~ m k )q 

=> ^n fc+1 > >q=l-p 

n k +i 

=>hF n , Y (p) =rq FniY (p) = 1 
=> {C fc , i.o.} C {lq Fn , Y (p) = rq FnY {p) = 1, z.o.}. 

Let us compute the probability of C^: 

P(C fe ) = 

■P0^ fc +i H h F mfc - (m fc - n fe )g < -n k ) x 

P(y mfe+ i H h Y nk+1 - (n k+ i - m k )q > m k ) = 

P(Y 1 + ■■■ + r 0(nfc) - 4>(n k )q < -n k )x 
P (Y 1 + ... + Y Hmk) - <j)(m k )q > m k ) > 1/4.1/4 = 1/16. 

We conclude that 

oo 

Y,P(C k ) = oo. 
fc=l 

By the Borel-Cantelli Lemma, P(C k , i.o.) = 1. We conclude that 

p ( l( lF n Av) = rq Fn<Y (p) = 0, i.o.) = 1, 

and 

p {k Fn , Y ip) = r lFn, Y (p) = 1, *-o.) = 1. 
Hence, 

p ({ l( lF n Av) = rq Fn<Y (p) = 0, i.o.} n 0gjr n , r (p) = rq Fn<Y (p) = 1, i.o.}) = 1. 



Proof (Theorem 12.51 part (c)) 

Suppose that rqp(p) = x\ ^ lq F {p) = Xi and a is an arbitrary real number. Let 

h = X2 — x\. We define a new chain Y as follows: 



11 



y __ , - • X i < l QF X (?) 

-/i Xi >rq Fx (p). 

(See Figured)) Then Yi, Y2, • • • is an i.i.d sample. We drop the index i from Y^ and 
Xi in the following for simplicity and since the Yi (as well as the Xi) are identically 
distributed. We claim 

lq FY Y(p) = rq Fy (p) = lq Fx (p). 
To prove lqF Y (j>) = Iqf x (p), note that 

Mk Fx (p)) = p(y < IqfM) > P(X < lq Fx (p)) >P^ WfAp) < IqfAp)- 

(The first inequality is because Y < X.) Moreover for any y < lq Fx (p), F Y (y) = 
Fx(y) < P- (Since X, Y < lq Fx {p) =>- X = Y.) Hence lqF Y {p) > Iqf x (p) an d 
we are done. To show rq Fy (p) = lqp x (p), note that rq Fy (p) > lq Fy (p) = lqF x {p)- 
It only remains to show that rq Fy (p) < lqF x {p)- Suppose y > lq Fx {p) and let 
5 = y — lqp x (p) > 0. First note that 

P({Y<lq Fx (p) + 5}) = 

P{{Y < lq Fx (p) + 5 and X > rq Fx (p)} U 

{Y < lq Fx (p) + 6 and X < lq Fx (p)}) = 

P({X -h< lq Fx (p) + 5 and X > rg Fx (p)} U 

{X < lq Fx (p) + 5 and X < Zg Fx (p)}) = 

P({rg Fx (p) < X < rg Fx (p) + 6} U {X < ^ x (p)}) = 

P({X<rq Fx (p) + 6}). 



Hence 



F Y (y) = P{Y < lq Fx (p) + 5) = P(X < rq Fx (p) + S) > p => 
rq FY (p) < V, Vy >lq Fx (p)- 

We conclude that rq Fy (p) < lq FY (p). 

To complete the proof of part (c) observe that for every e > 0, we may suppose that 

kF n , Y (p) e (q FY (p) ~ e, g Fy (p) + e). Then 

lQF n ,x(p),rq Fn Jp) G (^ x (p) - e,rq Fx {p) + e). (1) 
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This is because from lqF nY (p) e (Qf y (p) ~ e ; Qf y (p) + e ); we ma y conclude that 

F n ,Y(qF Y (p) + e) > p =► F n>x (rq Fx (p) + e) > p ^ 
kF niX (p),rq Fn , x (p) < rq Fx (p) + e, 

and 

F n ,Y(qF Y (p) - e) < p => F nx (lq Fx (p) - e) < p => 

hF n , x (p),rq Fn<x (p) > lq Fx {p) - e. 



But by part (a) of Theorem 12.51 lq FnY (p) — > qF Y (p) and rq FnY (p) — > q Fy (p). Hence 
for given e > there exists an integer iV such that for any n > N, lq Fn Y (p) G 
{q FY {p) — e, q F y{p) + e). By ([I]), we have shown that for every e > there exists iV 
such that for every n > N 

QF niX (p),rq FnX (p) G (lq Fx (p) - e,rq Fx (p) + e), 

since 

P(Xj G (lq Fx (p),rqF x (p)) for some i G N) = 0. 

We can conclude that 

p (kF niX (p) e (hF x (p),rq Fx (p)) for some z G N) = 

and 

p i. r( lF^ x {p) G (lqF x (p),rq Fx (p)) for some i G N) = 0. 

Hence with probability 1 

<lF nt M, rq Fn , x (p) e (lq Fx {p) - e, ^ x (p)] u Nf x (p), r <?F x (p) + e). 
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Fig. 1: The solid line is the distribution function of {Xj}. Note that for the 
distribution of the X; L and p = 0.5, lqF x (p) = 0,rg^ x (p) = 3. Let 
h = rq(p) — lq(p) = 3. The dotted line is the distribution function of the 
{Yi} which coincides with that of {X{\ to the left of lqF x (p) an d is a back- 
ward shift of 3 units for values greater than rqp x (p). Note that for the {Y^}, 
lQF Y (p) =rq FY (p) = 1. 
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